A Details on the Multinomial-dirichlet Model with Correlated Allele Frequencies
نویسندگان
چکیده
Supplementary material for Bioinformatics Application Note: “Correcting for ascertainment bias in the inference of population structure.” Gilles Guillot 1∗ , Matthieu Foll 2,3 1 Centre for Ecological and Evolutionary Synthesis, Department of Biology, University of Oslo. P.O. Box 1066 Blindern 0316 Oslo Norway. 2 Computational and molecular population genetics lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland. 3 Swiss Institute of Bioinformatics.
منابع مشابه
Clustering Images with Multinomial Mixture Models
In this paper, we propose a method for image clustering using multinomial mixture models. The mixture of multinomial distributions, often called multinomial mixture, is a probabilistic model mainly used for text mining. The effectiveness of multinomial distribution for text mining originates from the fact that words can be regarded as independently generated in the first approximation. In this ...
متن کاملProducing Power-Law Distributions and Damping Word Frequencies with Two-Stage Language Models
Standard statistical models of language fail to capture one of the most striking properties of natural languages: the power-law distribution in the frequencies of word tokens. We present a framework for developing statistical models that can generically produce power laws, breaking generative models into two stages. The first stage, the generator, can be any standard probabilistic model, while ...
متن کاملDirichlet negative multinomial regression for overdispersed correlated count data
A generic random effects formulation for the Dirichlet negative multinomial distribution is developed together with a convenient regression parameterization. A simulation study indicates that, even when somewhat misspecified, regression models based on the Dirichlet negative multinomial distribution have smaller median absolute error than generalized estimating equations, with a particularly pr...
متن کاملDirichlet Mixtures, the Dirichlet Process, and the Structure of Protein Space
The Dirichlet process is used to model probability distributions that are mixtures of an unknown number of components. Amino acid frequencies at homologous positions within related proteins have been fruitfully modeled by Dirichlet mixtures, and we use the Dirichlet process to derive such mixtures with an unbounded number of components. This application of the method requires several technical ...
متن کاملOn Eliciting Logistic Normal Priors for Multinomial Models
Multinomial models arise when there is a set of complementary and mutually exclusive categories and each observation falls into one of these categories. Such models are used in many scientific and industrial applications. For example, they are frequently applied to the compositions of rocks in geology, to patterns of consumer selection preferences in microeconomics, and to voting behavior in po...
متن کامل